Blind speech source localization, counting and separation for 2-channel convolutive mixtures in a reverberant environment

نویسندگان

  • Sayeh Mirzaei
  • Hugo Van hamme
  • Yaser Norouzi
چکیده

In this paper, the tasks of speech source localization, source counting and source separation are addressed for an unknown number of sources in a stereo recording scenario. In the first stage, the angles of arrival of individual source signals are estimated through a peak finding scheme applied to the angular spectrum which has been derived using non-linear GCC-PHAT. Then, based on the known channel mixture coefficients, we propose an approach for separating the sources based on Maximum Likelihood (ML) estimation. The predominant source in each time-frequency bin is identified through ML assuming a diffuse noise model. The separation performance is improved over a binary time-frequency masking method. The performance is measured by obtaining the existing metrics for blind source separation evaluation. The experiments are performed on synthetic speech mixtures in both anechoic and reverberant environments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Blind Separation of Speech Convolutive Mixtures via Time-Frequency Masking

An ideal binary masking, which specifies regions in the time-frequency domain whose concerned signal energy is greater than the interference signals is analyzed. The performance of the signal separation when these ideal binary masks are applied is evaluated. In the tests, these ideal masks remove almost all the interference from the other source of convolutive mixtures using simulated room impu...

متن کامل

Blind speech separation of moving speakers in real reverberant environments

In this paper we present a new on-line Blind Signal Separation method capable to separate convolutive speech signals of moving speakers in highly reverberant rooms. The separation network used is a recurrent network which performs separation of convolutive speech mixtures in the time domain, without any prior knowledge of the propagation media, based on the Maximum Likelihood Estimation (MLE) p...

متن کامل

Convolutive Blind Source Separation for Noisy Mixtures

The problem of separating convolutive mixtures of unknown time series arises in several application domains, a prominent example being the so-called cocktail party problem, where we want to recover the speech signals of multiple speakers who are simultaneously talking in a room. The room may be reverberant due to reflections on the walls, i.e., the original source signals sq(n), q = 1, . . . , ...

متن کامل

Removal of residual crosstalk components in blind source separation using LMS filters

The performance of Blind Source Separation (BSS) using Independent Component Analysis (ICA) declines significantly in a reverberant environment. The degradation is mainly caused by the residual crosstalk components derived from the reverberation of the jammer signal. This paper describes a post-processing method designed to refine output signals obtained by BSS. We propose a new method which us...

متن کامل

The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech

Despite several recent proposals to achieve blind source separation (BSS) for realistic acoustic signals, the separation performance is still not good enough. In particular, when the impulse responses are long, performance is highly limited. In this paper, we consider a two-input, two-output convolutive BSS problem. First, we show that it is not good to be constrained by the condition , where i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014